Describing Spoken Dialogue Systems Differences
نویسندگان
چکیده
Conventional metrics in evaluating task-oriented Spoken Dialogue Systems (SDSs) are objective metrics such as the mean number of turns, or the estimated success rate. We provide further empirical evidence that these metrics are unreliable to distinguish across SDS: they fail to distinguish changes in the system, and detect differences where there are none. For instance, we report that the mean estimated success can have statistically significant differences (at the 5% significance level) on exactly the same system, over different, contiguous periods of time. We propose Dialogue System Difference Finder (DSDF), a novel model which can explain that the differences found using conventional metrics are due to seasonal and usage characteristics. DSDF is able to describe differences between multiple SDS, and it is different from traditional performance metrics used in the SDS community, in that it is more sensitive to changes among systems, has lower variance, and can also explain what the differences consist of. We used this model to find unexpected changes in our data sets. We believe that this lays the groundwork toward building a fully-automatic metric.
منابع مشابه
Studies on Robust Language and Dialogue Processing for Spoken Dialogue Systems
In spoken dialogue systems, robust language processing for spontaneous speech understanding and robust dialogue processing for achieving user goal are inevitable. Previously, research of speech recognition and research of natural language understanding were done independently. At first glance, it seems to be no problem to combine these two technologies, because the purpose of speech recognition...
متن کاملDesigning a domain independent platform of spoken dialogue system
Our purpose is to construct a domain independent platform for task-oriented spoken dialogue system. It is expected to generalize architecture of spoken dialogue system for making it easy to develop a system that can support and act to solve problems for user by using speech input and output. In this research, we propose a method of describing hierarchically internal knowledge for solving proble...
متن کاملA new Taxonomy for the Quality of Telephone Services Based on Spoken Dialogue Systems
This document proposes a new taxonomy for describing the quality of services which are based on spoken dialogue systems (SDSs), and operated via a telephone interface. It is used to classify instrumentally or expert–derived dialogue and system measures, as well as quality features perceived by the user of the service. A comparison is drawn to the quality of human–to–human telephone services, an...
متن کاملRobust and adaptive architecture for multilingual spoken dialogue systems
We present how robustness and adaptivity can be supported by the spoken dialogue system architecture. AthosMail is a multilingual spoken dialogue system for e-mail domain. It is being developed in the EU-funded DUMAS project. It has flexible system architecture supporting multiple components for input interpretation, dialogue management and output generation. In addition to language differences...
متن کاملAn educational dialogue system with a user controllable dialogue manager
We have developed an educational environment for a modular spoken dialogue system. The aim of the environment is to provide students, with different backgrounds, means to understand the behaviour of spoken dialogue systems. Focus in this paper is on dialogue and dialogue management. The dialogue is recorded in a dialogue tree whose nodes are dialogue objects. The dialogue objects model the cons...
متن کامل